首页> 外文OA文献 >An Active Learning Approach for Jointly Estimating Worker Performance and Annotation Reliability with Crowdsourced Data
【2h】

An Active Learning Approach for Jointly Estimating Worker Performance and Annotation Reliability with Crowdsourced Data

机译:一种共同评估工人绩效的主动学习方法   和众包数据的注释可靠性

摘要

Crowdsourcing platforms offer a practical solution to the problem ofaffordably annotating large datasets for training supervised classifiers.Unfortunately, poor worker performance frequently threatens to compromiseannotation reliability, and requesting multiple labels for every instance canlead to large cost increases without guaranteeing good results. Minimizing therequired training samples using an active learning selection procedure reducesthe labeling requirement but can jeopardize classifier training by focusing onerroneous annotations. This paper presents an active learning approach in whichworker performance, task difficulty, and annotation reliability are jointlyestimated and used to compute the risk function guiding the sample selectionprocedure. We demonstrate that the proposed approach, which employs activelearning with Bayesian networks, significantly improves training accuracy andcorrectly ranks the expertise of unknown labelers in the presence of annotationnoise.
机译:众包平台提供了一种实用的解决方案,可以对负担得起的大型数据集进行可负担的注释以训练受监督的分类器。不幸的是,糟糕的员工绩效经常会威胁到注释的可靠性,并且每个实例都要求多个标签会导致成本大量增加而无法保证良好的结果。使用主动学习选择程序将所需的训练样本最小化可减少标记要求,但会通过集中于错误的注释而危害分类器训练。本文提出了一种主动学习的方法,在该方法中,工人的绩效,任务难度和注释可靠性被共同估计,并用于计算指导样本选择过程的风险函数。我们证明了所提出的方法,它与贝叶斯网络一起使用主动学习,可以显着提高训练准确性,并在存在注释噪声的情况下正确地对未知标签的专业知识进行排名。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号